Mproving S Tochastic G Radient D Escent with F Eedback

نویسنده

  • Hiroaki Hayashi
چکیده

In this paper we propose a simple and efficient method for improving stochastic gradient descent methods by using feedback from the objective function. The method tracks the relative changes in the objective function with a running average, and uses it to adaptively tune the learning rate in stochastic gradient descent. We specifically apply this idea to modify Adam, a popular algorithm for training deep neural networks. We conduct experiments to compare the resulting algorithm, which we call Eve, with state of the art methods used for training deep learning models. We train CNNs for image classification, and RNNs for language modeling and question answering. Our experiments show that Eve outperforms all other algorithms on these benchmark tasks. We also analyze the behavior of the feedback mechanism during the training process.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

T He I Mplicit B Ias of G Radient D Escent on S Epa - Rable D

We show that gradient descent on an unregularized logistic regression problem, for almost all separable datasets, converges to the same direction as the max-margin solution. The result generalizes also to other monotone decreasing loss functions with an infimum at infinity, and we also discuss a multi-class generalizations to the cross entropy loss. Furthermore, we show this convergence is very...

متن کامل

Identification of perceptual attributes constituting the user’s experience of mechanical switches

The Rep ertory Grid Techn ique was applied to identif y th e perceptual a ttributes cons tituting the users experience of mechanical switches for seven sensory conditions. Participants (105) were subjected to six bimodal and unimodal senso ry combinations c reated b y s electively res tricting the f low of haptic, auditory and visual information, plus one condition with full sensory information...

متن کامل

Amitraz Poisoning; A case study

A m i t r a z, a n i ns e c t i c i d e /a ca ri c i de of the f o r m a m i d i n e p e st i c i d e s group, is a ? 2 a d r e n e r g i c ag on i st a nd of t he a m i d i ne c h e m i ca l f a m il y generally us e d to c o n t r ol animal e c top a r a s i t e s. Poisoning due to am i t r a z i s r a r e and character...

متن کامل

On Hop Roman Domination in Trees

‎Let $G=(V,E)$ be a graph‎. ‎A subset $Ssubset V$ is a hop dominating set‎‎if every vertex outside $S$ is at distance two from a vertex of‎‎$S$‎. ‎A hop dominating set $S$ which induces a connected subgraph‎ ‎is called a connected hop dominating set of $G$‎. ‎The‎‎connected hop domination number of $G$‎, ‎$ gamma_{ch}(G)$,‎‎‎ ‎is the minimum cardinality of a connected hop‎‎dominating set of $G$...

متن کامل

The Calculation of 14N Quadrupole Coupling Constants with Basis Sets of High Local Quality

An appropria te basis set to obtain electric field gradients close to the H artree-Fock lim it is evaluated for the nitrogen nucleus. An em pirical correction for the error due to the neglect o f electron correlation is given for sp-. sp : and s p ’-hybrized nitrogen. T he results are used to predict the full quadrupole coupling tensor for a few molecules. A breakdow n of the electric field gra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016